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T 1: X98075. Hepatitis B virus.. .[gi:1914710] Related Sequences, Protein, PubMed, Taxonomy 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 
FEATURES 

source 



CDS 



gene 
CDS 



HBVDEFVP2 3297 bp DNA circular VRL 21-JUL-1997 
Hepatitis B virus complete genome with insertion in core promoter, 
198bp insertion in X-ORF. 
X98075 

X98075.1 GI:1914710 

complete genome; core protein; polymerase; PreSl gene; S-protein. 
Hepatitis B virus. 
Hepatitis B vir us 

Viruses; Retroid viruses; Hepadnaviridae ; Orthohepadnavirus . 

1 (bases 1 to 3297) 

Pult,I., Chouard,T., Wieland,S., Klemenz,R., Yaniv,M. and Blum, H.E. 
A hepatitis B virus mutant with a new hepatocyte nuclear factor 1 
binding site emerging in transplant-transmitted fulminant hepatitis 
B 

Hepatology 2 5 (6), 1507-1515 (1997) 
97329263 

2 (bases 1 to 3297) 
Pult, I . 

Direct Submission 

Submitted ( 16-MAY-1996 ) I. Pult, Department of Pathology, 
Laboratory of Molecular Medicine, G Lab 14, Schmelzbergstrasse 12, 
CH-8091 Zurich, SWITZERLAND 

Related sequences: D00329, D00330 and D00331. 
Location/Qualifiers 
1..3297 

/organism^ "Hepatitis B virus" 
/strain^" subtype adw" 
/db_xref="taxon: 10407" 

/note=" complete genome; defective viral particle" 

155 . .835 

/codon_start=l 

/product=" S-protein" 

/protein id= " CAA66686 . 1 " 

/db_xref="GI: 1914711" 

/db_xref = " SPTREMBL : 012403 " 

/ trans 1 a t i on= " MES I ASGLPGPLLVLQAGFFLLTKILTI PQSLDSWWTSLNFLGG 
TPVCLGQNSQSQISSHSPTCCPPICPGYRWMCLRRFIIFLCILLLCLIFLLVLLDYQG 
MLPVCPLIPGSSTTSTGPCKTCTAPAQGTSMFPSCCCTKPTDGNCTCIPIPSSWAFAK 
YLWEWASVRFSWLSLLVPFVQWFVGLSPTVWLSVIWMMWFWGPSLYNILSPFIPLLPI 

FFCLWVYI " 

1374 . . 1787 

/gene=" X-ORF " 

1374 . . 1787 

/gene= "X-ORF " 

/note=" 198 bp insertion" 

/codon_start=l 

/•protein id= " CAA66 687 . 1 " 

/db_xref ="GI : 1914712 " 



http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=nucleo 10/22/2001 
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BASE COUNT 7 58 

ORIGIN 

1 ttccaccact 

61 tggtggctcc 

121 aatcttatcg 

181 aggacccctg 

241 acagagtcta 

301 tggccaaaat 

361 tcctggttat 

421 atgcctcatc 

481 aattccagga 

541 aggaacctct 

601 tattcccatc 

661 tttctcttgg 

721 tgtctggctt 

781 gagtcccttt 

841 cacaaaacaa 

901 ggcacattgc 

961 gtaaacaggc 

1021 gcccctttca 

1081 aaacaggctt 



/ db_xr e f = " S PTREMBL : 0 1 2 4 0 9 " 

/ trans la t ion = " MAARLCCQLDPARDVLCLRPVGAESRGRPLPGPLGALPPASPPV 

VPTDHGAHLSLRGLPVCAFSSAGPCALRFTSARRMETTVNAHRNLPKVLHKRTLGLST 

MSTTDLEAYFKDCVFTEWEELGEEVRLKVFVLVNH " 

1776 . . 1786 

/gene="X-ORF" 

/note= " insertion in the core promoter" 

/bound_moiety="hepatocyte nuclear factor 1" 

2110 . .2655 

/codon_start=l 

/product=*'core protein" 

/protein .id= " CAA66688 . 1 " 

/db_xref ="GI : 1914713 " 

/ db_x r e f = " S PTREMBL : 0 0 9 5 1 2 " 

/translation= "MDIDTYKEFGASVELLSFLPSDFFPSIRDPLDTATALHREALES 

PEHCSPHHTALRQAIVCWGELMNLATWVGSNLEDPASRELVVSYVNVNMGLKIRQLLW 

FHISCLTFGRETVLEYLVSFGVWIRTPPAYRPPNAPILSTLPETTWRRRGRSPRRRT 

PSPRRRRSQSPRRRRSQSRES n 

2516. .3160 

/codon_start=l 

/produc t= "polymerase " 

/protein id= " CAA66689 . 1 " 

/db_xref = "GI : 1914714 " 

/db_xref = " S PTREMBL : 009513 " 

/ trans la t i on= " MPLSYQHFRKLLLLDEEAGPLEEELPRLAEEGLNRRVAEDLNLG 

NLNVSIPWTHKVGNFTGLYSSWPCFNPKWQTPSFPDIHLQEDIVDRCKQFVGPLTVN 

ENRRLKLIMPARFYPNVTKYLPLDKGIKPYYPEYWDHYFQTRHYLHTLWKAGILYKR 

ESTRSASFCGSPYSWEQDLQHTSKRHGDESFCPQSLGFFPDHQLDPAFKANSLG " 

3072.. 3266 

/gene= "preSl " 

3072 . . 3266 

/gene="preSl" 

/note=" 18 bp and 108 bp deletion" 

/codon_start=l 

/protein id=" CAA66690 . 1 " 

/db_xref = "GI : 1945151 " 

/db_xref = " S PTREMBL : 009514 " 

/ 1 r ans 1 a t i on= " MGTNLS VPNPWDS S P 1 1 SWTLH SKPTRWGRALRLRAYSQLCQQL 
LLLPPPIGSQEGSLLPYLHL " 
a 859 c 751 g 929 t 



ttccaccaaa 
agttcaggaa 
aagactgggg 
ctcgtgttac 
gactcgtggt 
tcgcagtccc 
cgctggatgt 
ttcttgttgg 
tcatcaacca 
atgtttccct 
ccatcatctt 
ctcagtttac 
tcagttatat 
ataccgctgt 
aaagatgggg 
cacaggaaca 
ctattgattg 
cgcaatgtgg 
ttactttctc 



ctcttcaaga 
cagtgagccc 
accctgtgcc 
aggcggggtt 
ggacttctct 
aaatctccag 
gtctgcggcg 
ttcttctgga 
ccagcacggg 
catgttgctg 
gggctttcgc 
tagtgccatt 
ggatgatgtg 
taccaatttt 
atattccctt 
tattgtacaa 
gaaagtatgt 
atatcctgct 
gccaacttac 



tcccagagtc 

tgctcagaat 

gaacatggag 

tttcttgttg 

caattttcta 

tcactcacca 

ttttatcatc 

ctatcaaggt 

accatgcaag 

tacaaaacct 

aaaataccta 

tgttcagtgg 

gttttggggg 

cttttgtctt 

aacttcatgg 

aaaatcaaaa, 

caacgaattg 

ttaatgcctt 

aaggcctttc 



agggccctgt 
actgtctctg 
agcatcgcat 
acaaaaatcc 

gggggaacac 

acctgttgtc 
ttcctctgca 
atgttgcccg 
acctgcacag 
acggacggaa 
tgggagtggg 
ttcgtagggc 
ccaagtctgt 
tgggtataca 
gatatgtaat 
tgtgttttag 
tgggtctttt 
tatatgcatg 
taagtaaaca 



accttcctgc 
ccatatcgtc 
caggactccc 
tcacaatacc 
ccgtgtgtct 
ctccaatttg 
tcctgctgct 
tttgtcctct 
ctcctgctca 
actgcacctg 
cctcagtccg 
tttcccccac 
acaacatctt 
tttaaaccct 
tgggagttgg 
gaaacttcct 
ggggtttgcc 
tatacaagca 
gtatctgaac 
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1141 ctttaccccg ttgctcggca acggcctggt ctgtgccaag tgtttgctga cgcaaccccc 
1201 actggttggg gcttggccat aggccatcag cgcatgcgtg gaacctttgt gtctcctctg 
1261 ccgatccata ctgcggaact cctagccgct tgttttgctc gcagcaggtc tggggcaaaa 
1321 ctcatcggga ctgacaattc tgtcgtgctc tcccgcaagt atacatcatt tccatggctg 
1381 ctaggctgtg ctgccaactg gatcctgcgc gggacgtcct ttgtttacgt cccgtcggcg 
1441 ctgaatcccg cggacgaccc ctcccggggc cgcttggggc tctaccgccc gcttctccgc 
1501 ctgttgtacc gaccgaccac ggggcgcacc tctctttacg cggactcccc gtctgtgcct 
1561 tctcatctgc cggaccgtgt gcacttcgct tcacctctgc acgtcgcatg gaaaccaccg 
1621 tgaacgccca caggaacctg cccaaggtct tgcataagag aactcttgga ctttcaacaa 
1681 tgtcaacgac cgaccttgag gcatacttca aagactgtgt gtttactgag tgggaggagt 
1741 tgggggagga ggttaggtta aaggtctttg tactagttaa tcattaggag gctgtaggac 
1801 gtcgcatgga aaccaccgtg aacgcccaca ggaacctgcc caaggtcttg cataagagaa 
1861 ctcttggact ttcaacaatg tcaacgaccg accttgaggc atacttcaaa gactgtgtgt 
1921 ttactgagtg ggaggagttg ggggaggagg ttaggttaaa ggtctttgta ctagttaatc 
1981 attaggaggc tgtaggcata aattggtgtg ttcaccagca ccatgcaact ttttcacctc 
2041 tgcctaatca tctcttgttc atgtcctact gttcaagcct ccaagctgtg ccttgggtgg 
2101 ctttggggca tggacattga cacgtataaa gaatttggag cttctgtgga gttactctct 
2161 tttttgcctt ctgacttctt tccttctatt cgggatcccc tcgacaccgc cactgctctg 
2221 catcgggagg ccttagagtc tccggaacat tgttcacctc accatacggc actcaggcaa 
2281 gctattgtgt gttggggtga gttgatgaat ctagccacct gggtgggaag taatttggaa 
2341 gatccagcat ccagggaatt agtagtcagc tatgtcaacg ttaatatggg cctaaaaatc 
2401 agacaactat tgtggtttca catttcctgt cttacgtttg ggagagaaac tgttcttgaa 
2461 tatttggtgt cctttggagt gtggattcgc actcctcctg catacagacc accaaatgcc 
2521 cctatcttat caacacttcc ggaaactact gttgttagac gaagaggcag gtcccctaga 
2581 agaagaactc cctcgcctcg cagaagaagg tctcaatcgc cgcgtcgcag aagatctcaa... - 
2641 tctcgggaat cttaatgtta gtattccttg gacacataag gtgggaaact ttacggggct 
2701 ttattcttct acggtacctt gctttaatcc taaatggcaa actccttctt ttcctgacat 
2761 tcatttgcag gaggacattg ttgatagatg taagcaattt gtggggcccc ttacagtaaa 
2821 tgaaaacagg agactaaaat taattatgcc tgctaggttt tatcccaatg ttactaaata 
2881 tttgccctta gataaaggga tcaaaccgta ttatccagag tatgtagttg atcattactt 
2941 ccagacgcga cattatttac acactctttg gaaagcgggg atcttatata aaagagagtc 
3001 cacacgtagc gcctcatttt gcgggtcacc atattcttgg gaacaagatc tacagcatac 
3061 ctcgaaaagg catggggacg aatctttctg tccccaatcc ctgggattct tccccgatca 
3121 tcagttggac cctgcattca aagccaactc gttggggtag agccctcagg ctcagggcct 
3181 actcacaact gtgccagcag ctcctcctcc tgcctccacc aatcggcagt caggaaggca 
3241 gcctactccc ttatctccac ctctaagaga cactcatcca caggccatgc agtggaa 

// 
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